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Abstract — Most design approaches for trellis-coded quantiza- 
tion take advantage of the duality of trellis-coded quantization 
with trellis-coded modulation, and use the same empirically- 
found convolutional codes to label the trellis branches. This letter 
presents an alternative approach that instead takes advantage of 
maximum-Hamming-distance convolutional codes. The proposed 
source codes are shown to be competitive with the best in the 
literature for the same computational complexity. 

Index Terms — Source coding, trellis-coded quantization, geo- 
metrically uniform codes, constellation labeling, distance-optimal 
convolutional codes. 



I. Introduction 

IT IS well known that trellis source coding asymptotically 
achieves the bounds of Shannon theory at the price of 
constructing (and storing) huge codebooks of random repro- 
duction values [TJ- More feasible trellis codes are instead 
obtained by reducing and structuring the codebooks. Trellis- 
coded quantization (TCQ) represents a successful attempt in 
this direction 0. 

TCQ has first appeared as a natural counterpart, in the 
source coding framework, of trellis-coded modulation 0. 
The basic TCQ scheme uses an extended codebook of 2 R+1 
reproduction values to code a source at a rate of R bit/sample. 
This codebook is divided into four sub-codebooks that are 
assigned to the branches of a trellis diagram by means of a 
(2, 1) binary convolutional code, and the Viterbi algorithm is 
used to find the minimum mean-squared-error path through the 
trellis. Among the others, TCQ and TCQ-like coding (0, 0) 
have found application in image coding [6j, 0, in scalable 
coding |8| and in multiple description coding 0. 

Since the choice of the actual reproduction values is respon- 
sible for the achievable performance [10], several algorithms 
have been proposed to find the best codebook. For instance, 
in the original paper an iterative algorithm is given for 
alphabet optimization that improves on the uniform distribu- 
tion. In other approaches, in an attempt to achieve some of the 
boundary gain too [11], the occurrences of each reconstruction 
level are taken into account during the optimization (and the 
coding) procedure. Examples are given in lfl2l (that refines the 
work in |[T3ll ) and in fl4l . where the concept is more elegantly 
approached via Lagrangian optimization. 

Observing that the size of the reproduction alphabet puts (at 
rates close to log 2 |-4|, with |.4| the size of the alphabet) a non- 
negligible limit on the achievable rate-distortion performance 
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fl5l . recent approaches fl6l . ifTTl have investigated the op- 
portunity to use larger codebooks for improved performance. 
In particular, in IfTTl a linear congruential recursion is used to 
randomize with a very small computational burden the labels 
associated with the trellis branches. 

The performance improvement over the basic TCQ systems 
was eventually obtained by taking into account for the rate 
(i.e. by entropy coding) or by increasing the reproduction set, 
but essentially keeping the trellis structure of a convolutional 
binary code for codeword randomization. As observed in [17], 
in fact, there seems to be no performance gain from changing 
the very simple shift-register transitions. 

Despite the fact that eventually TCQ heavily relies on 
convolutional codes, the current know-how about the algebraic 
properties of the convolutional codes has not been considered 
for the design of TCQ systems, which are instead based on 
ad-hoc convolutional codes (such as the ones in 0). In this 
letter a TCQ design technique based on maximum-Hamming- 
distance convolutional codes is investigated for high rate TCQ. 
The obtained codes are shown to be competitive with the ones 
in the literature for the same computational complexity^ 

II. Maximum-Hamming-Distance-Based 
Trellis-Coded Quantizer Design 

Suppose that the source to be quantized produces sequences 
of independent and identically distributed samples. According 
to the generally accepted Gersho's conjecture fl9| , the Voronoi 
cells of the best vector quantizer are all asymptotically con- 
gruent to the same polytope, with the possible exception of 
the ones touching the boundary of the typical set over which 
the source outcomes are uniformly distributed. At high rates, 
i.e. when most Voronoi cells lie in the interior of the typical 
set, it is then reasonable that all granular gain ifTTIl can be 
achieved by means of a geometrically uniform code [20|. 
Successively, entropy coding achieves all boundary gain IfTTIl . 
independently of the source distribution. 



If the extended codebook S C 



is a geometrically 



uniform set that can be partitioned into the cosets of S' < S, 
and S/S' is isomorphic to the group ({0, l} 2 , ®), then TCQ 
defines a geometrically uniform code 1201 . In particular, it 
is the generalized coset code C(S/S', Cl-i) C M. nl , where 
Cl-i are the codewords of maximum degree L— 1 of the used 
(2, 1) convolutional code ETIl . The rate-distortion performance 
of C(S/S',Cl-i) depends on the reciprocal position of 

'This fact seems to show that the folk channel coding result stating 
essentially that all "sensible" labelings lead to nearly equivalent performance 
1181 holds as well in the source coding domain. 
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the coset representatives of the partition C(S/S', Ci,-\)/S' L 
which are the closest to the null element 0^, of S' L . The aim 
is hence to maximize the reciprocal distance of these coset 
representatives. 

A. Distance Preserving Labelings 

Since the representatives of the different cosets are related 
to the different paths on the trellis, and then to different 
codewords of the underlying convolutional code, it is worth to 
investigate if there exist systems that associate more distant 
coset representatives (in Euclidean signal space) to more 
distant codewords (i.e. with large Hamming-distance). In such 
a case, distance-optimal binary convolutional codes would lead 
to generalized coset codes with good distance properties. 

Any coset representative of the partition 
C(S/S',Cl-i)/S' L is simply an L-tuple (so, si, . . . , Sl-i) 
of points of S such that s n € ju(a n ), n — 0, 1, . . . , L — 1, 
where \i is the isometric labeling [20] of the partition S/S' 
and (do, Oi, • • • , Ol-i) is the codeword of the underlying 
convolutional code which specifies the considered coset. 
Since the (squared) Euclidean-distance is additive, among 
the possible coset representatives of a coset the closest 
to the point Ojt, is the one for which s„ S p,{a n ) is the 
closest to Oss for each n = 0, 1, . . . , L — 1, Let us denote 
with S the subset of S which exactly contains all the coset 
representatives of the partition S/S' which are the closest to 
0s/. 

Like (squared) Euclidean-distance, Hamming-distance is 
additive. To assign more distant coset representatives to more 
distant codewords it is then sufficient for /i to label the 
elements of S (and hence of S) with binary tuples such that the 
closer two points are in TV-dimensional Euclidean space, the 
smaller is the Hamming-distance between the corresponding 
labels. A binary isometric labeling satisfying this property has 
been called a distance preserving labeling. In the following 
common cases such a labeling exists. 

• One-Dimensional Partition Z/4Z. Consider, for N = 1, 
the partition S/S' — Z/4Z, that corresponds to the 
(scaled and) infinitely extended alphabet case obtained 
from the scalar example in 0. Usually, the partition is 
labeled as in Fig. |l(a)| and ad-hoc (1,2) binary convo- 
lutional codes are used. A possible distance preserving 
labeling is shown in Fig. |l(b)| labels differing by one or 
two bits are respectively given to cosets whose minimum 
(squared) Euclidean-distance equals one or four. 
> Two-Dimensional Partition Z 2 /2Z 2 . For N = 2, consider 
the partition S/S' — Z 2 /2Z 2 , that again corresponds 
to the (scaled and) infinitely extended alphabet case 
obtained from the two-dimensional example in (2). A 
distance preserving labeling is shown in Fig. |l(c)| la- 
bels differing by one or two bits are respectively given 
to cosets whose minimum (squared) Euclidean-distance 
equals one or two. 

III. Simulation Results 

At high rates, and assuming that the symbols output by 
the trellis-coded quantizer are entropy-coded down to their 
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Fig. 1. (a,b) Two isometric labelings for the partition Z/4Z; (c) a distance 
preserving labeling for the partition 1? jlT? . 

entropy, the asymptotic performance of geometrically uniform 
TCQ is measured by the asymptotic normalized second mo- 
ment (per dimension) of the Voronoi cell of C(S/S', Cl-i), 
independently from both the rate and the probability distribu- 
tion of the source Hill . Equivalently, the performance can be 
expressed in terms of coding gain w.r.t. entropy-coded uniform 
quantization by the asymptotic value of the granular gain, 
j g , which simply represents the factor of reduction of the 
normalized second moment obtained by shaping an hypercube 
into the shape of the Voronoi cell of the trellis-coded quantizer. 

In the following, hence, the granular gain of the traditional 
TCQ systems found in the literature with the granular gain of 
maximum-Hamming-distance-based systems having the same 
number of states (i.e., exactly the same computational com- 
plexity) are compared. 



A. Experimental Setup 

The approximated second moment of a Voronoi cell can be 
evaluated only by simulation. If an LA^-dimensional region 
containing several cells (along each coordinate) is uniformly 
populated by N v random vectors (sequences), the average per 
dimension energy of the corresponding quantization errors 
approximates the second moment (per dimension) P(C) itself @ 

2 In fact, the great number of cells guarantees that there are no border effects 
in this evaluation. 
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TABLE I 

Asymptotic granular gain (in dB) for the partitions Z/4Z 

(TOP-HALF) AND I? /2I? (BOTTOM-HALF); LN = 1000 SAMPLES. 



states 


Ungerboeck codes 


distance-optimal codes 




code 


granular gain 


code 


granular gain 


4 


[5 2] 


1.005 ± .0018 


[5 7] 


1.003 ± .0018 


8 


[13 04] 


1.097 ± .0016 


[13 17] 


1.098 ± .0016 


16 


[23 04] 


1.162 ± .0014 


[23 35] 


1.169 ± .0014 


32 


[45 10] 


1.232 ± .0013 


[53 75] 


1.233 ± .0013 


64 


[103 024] 


1.282 ± .0012 


[133 171] 


1.290 ± .0012 


128 


[235 126] 


1.332 ± .0010 






256 


[515 362] 


1.369 ± .0010 


[561 753] 


1.375 ± .0010 


1024 






[2335 3661] 


1.428 ± .0008 


4 


[5 2] 


0.981 ± .0018 


[5 7] 


0.980 ± .0019 


8 


[13 04] 


1.074 ± .0017 


[13 17] 


1.074 ± .0017 


16 


[23 04] 


1.124 ± .0016 


[23 35] 


1.137 ± .0015 


32 


[45 10] 


1.171 ± .0015 


[53 75] 


1.191 ± .0015 


64 


[103 024] 


1.225 ± .0014 


[133 171] 


1.243 ± .0014 


128 


[235 126] 


1.272 ± .0014 






256 


[515 362] 


1.303 ± .0013 


[561 753] 


1.311 ± .0013 


1024 






[2335 3661] 


1.351 ± .0013 



Hence, the quantity 



JV„-1 



LN-l 
— V (X 

LN f-^ 

3=0 



(1) 



can be evaluated, where x% t j is the j-th realization of the 
z-th random sequence (and Xij is its reconstruction). The 
fidelity of this approximation can be measured as follows. 
Note that for high values of LN, one can appeal to the central 
limit theorem and hence assume that the expression in square 
brackets in ([TJ is a Gaussian r.v. belonging to AT(P(C),<r 2 ). 
Consequently, P(C) G N{P{C), <r 2 /N v ) and in the 95% of 
the cases 



P(C) - P(C) 



< 2- 



AP 



/N v ^N v 

where the best unbiased estimate of the standard deviation 
given the N v measured distortion samples can be used as a. 

In the experiments, the granular gain is evaluated with N v = 
5000 sequences of length LN — 1000, randomly distributed 
in the hypercube with sides equal to 2 R ■ 2 and to 2 R ■ \/2 
respectively for the case Z/4Z and the case Z 2 /2Z 2 , with 
R = 8. 

B. Results at High Rates 

The top-half of Table |T] compares the granular gains of 
the Z/4Z-TCQ systems in (2) with the ones of the proposed 
systems that are based on the labeling of Fig. |l(b)| For each 
number of states, the codes are shown in octal form. The codes 
shown in the fourth column of Table Q] are distance-optimal 
(2, 1) convolutional codes taken from [21|. The values in the 
left hand side of Table U confirm the results at high rates of 

and en. 

For most number of states, the proposed codes exactly 
achieve the same granular gain. In particular, the codes with 




Fig. 2. Experimental granular gain for the partition Z 2 /2Z 2 , using 16-, 
32-, 64- and 256-state distance-optimal codes. The dashed lines represent the 
granular gain obtained using the Ungerboeck codes with the same number of 
states. 



4 and 8 states lead to systems exactly equivalent to the corre- 
sponding ones based on Ungerboeck codes. A measurable, yet 
almost negligible improvement is instead measured for codes 
with 16, 64, and 256 states. 

The 1024-states code [2335 3661] is asymptotically only 
about 0.1 dB farther from the ultimate granular gain of 
1.53 dB that separates entropy constrained scalar quantization 
from the Shannon lower bound. This means that this trellis- 
coded quantizer, at high rates, almost fills the gap between 
the performance of scalar quantization and the theoretic rate- 
distortion curve. 

The Z 2 /2Z 2 -based TCQ systems are compared in the 
bottom-half of Table U In these cases each successive couple of 
samples is constrained on a squared lattice (a lattice translate 
of 2Z 2 ), and the convolutional codes can only randomize the 
relative position of the successive couples of samples. Conse- 
quently, compared to the Z/4Z case, where the convolutional 
code offers the possibility to control the relative position of 
each successive sample, giving more freedom to the system, 
the overall performance is slightly reduced. 

However, improvements are again obtained over the systems 
usually found in literature. The asymptotic coding gain is in 
fact higher for all the codes with 16, 32, 64, and 256 states. 
Close examination of the granular gain of these codes for 
different numbers of coded samples, compared with the gran- 
ular gain of the traditional systems using the corresponding 
codes (see Fig. 13, shows that they give improvements even 
for LN < 1000. 

C. Results at Low Rates 

Since the asymptotic gains describe only the high rate 
performance, it is interesting to investigate if the proposed 
codes are competitive with the traditional ones even at low 
rates. By simply limiting the extended codebook to only 2 R+1 
consecutive elements of 0.5 + Z (assuming a symmetric source 
distribution with respect to zero) it would be possible to 
achieve any (integer) desired limited bit rate. However, the 
performance increases if such a finite alphabet is optimized 
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TABLE II 

Comparison of the signal-to-quantization noise ratio relative 
TO the traditional and the proposed codes (using M states), 
for Z/4Z-TCQ; LN = 1000 uniformly (top-half) and Gaussian 

(BOTTOM-HALF) DISTRIBUTED SAMPLES . 



rate 


M 


SQNR (dB) 


bit/sample 




traditional codes 


proposed codes 




16 


6.405 ± .0029 


6.412 ± .0029 


1 


64 


6.496 ± .0028 


6.519 ± .0027 




256 


6.579 ± .0027 


6.593 ± .0026 




16 


12.802 ± .0023 


12.812 ± .0023 


2 


64 


12.915 ± .0021 


12.936 ± .0021 




256 


13.012 ± .0019 


13.022 ± .0019 




16 


19. 024 ±.0019 


19.033 ± .0019 


3 


64 


19.142 ± .0017 


19.157 ± .0017 




256 


19.236 ± .0015 


19.243 ± .0015 




16 


5.285 ± .0060 


5.310 ± .0061 


1 


64 


5.440 ± .0060 


5.478 ± .0060 




256 


5.570 ± .0059 


5.583 ± .0059 




16 


10.778 ± .0071 


10.796 ± .0072 


2 


64 


10.927 ± .0072 


10.948 ± .0072 




256 


11.035 ± .0072 


11.043 ± .0072 




16 


16.412 ± .0087 


16.431 ± .0085 


3 


64 


16.560 ± .0082 


16.578 ± .0081 




256 


16.673 ± .0087 


16.683 ± .0087 



according to some training data set, while the encoding 
complexity remains essentially the same. 

The comparison, relative to the optimized finite alphabets 
with R equal to 1, 2, and 3 bit/sample, is shown in Table [TT] 
for both the uniform and the Gaussian distribution. 

The results show a similar improvement with respect to 
the TCQ systems in for the codes with 16, 64, and 
256 states. Distance-optimal convolutional codes are hence 
able to distribute the reconstruction values in a maximal- 
distance fashion even in non-regular arrays. If the burden 
of randomizing the labels of the trellis and increasing the 
codebook size is not wanted (see fl6l . H3), then the proposed 
systems are a feasible alternative to traditional trellis-coded 
quantizers. 

IV. Conclusion 

In this letter an approach to TCQ design has been presented 
that is based on coupling a distance preserving labeling 
of the extended codebook with maximum-Hamming-distance 
convolutional codes. In this way, the relation between the 
minimum distance of the feasible reconstructed sequences and 
the Hamming-distance of the underlying convolutional code is 
exposed. 

Results show that it is possible and somewhat advantageous 
to conduct the search for good trellis codes into the binary 
domain, i.e. to look for good convolutional codes. In fact, 
the codes found are competitive with or somewhat better than 
the ones in literature. In the Z/4Z case better codes have 
been found for the 16, 64, and 256 states case, while in the 
Z 2 /2Z 2 case better codes have been found for the 16, 32, 64, 
and 256 states cases. In addition, a 1024 states code is given 



that, according to the authors' knowledge, was never published 
before in the context of TCQ. 
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